Search CORE

20 research outputs found

READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival Documents

Author: Diem Markus
Fiel Stefan
Grüning Tobias
Kleber Florian
Labahn Roger
Publication venue
Publication date: 11/12/2017
Field of study

Text line detection is crucial for any application associated with Automatic Text Recognition or Keyword Spotting. Modern algorithms perform good on well-established datasets since they either comprise clean data or simple/homogeneous page layouts. We have collected and annotated 2036 archival document images from different locations and time periods. The dataset contains varying page layouts and degradations that challenge text line segmentation methods. Well established text line segmentation evaluation schemes such as the Detection Rate or Recognition Accuracy demand for binarized data that is annotated on a pixel level. Producing ground truth by these means is laborious and not needed to determine a method's quality. In this paper we propose a new evaluation scheme that is based on baselines. The proposed scheme has no need for binarization and it can handle skewed as well as rotated text lines. The ICDAR 2017 Competition on Baseline Detection and the ICDAR 2017 Competition on Layout Analysis for Challenging Medieval Manuscripts used this evaluation scheme. Finally, we present results achieved by a recently published text line detection algorithm.Comment: Submitted to DAS201

arXiv.org e-Print Archive

Crossref

Albiglutide and cardiovascular outcomes in patients with type 2 diabetes and cardiovascular disease (Harmony Outcomes): a double-blind, randomised placebo-controlled trial

Author: Abbas Jalal
Abraham Sybil
Ackert Jessica
Adams Patricia
Aggarwala Gaurav
Aher Nutan
Ahlqvist Jørn
Ahn Chul Woo
Akers Teresa
Akhras Ronald
Akhter Faiq
Aksentyev Sergey
Al Kawas Firas
Alexander Karen
Althouse Denise
Altobelli Desma
Alvarisqueta Andres
Ambrosy Andrew
Andersen James
Andersen Ulla
Anderson Michelle
Anglade Moise
Arbañil Hugo
Arciszewska Malgorzata
Argoud Georges
Ariani Mehrdad
Ashdji Reswan
Avogaro Angelo
Avramidis Iakovos
Axthelm Christoph
Aye Myint
Babi Charleen
Bachus Erasmus
Baik Sei-Hyun
Bailey Matt
Bakhtari Ladan
Baksi Arun
Balasubramani Mathangi
Baldovino Jorge
Banach Marek
Banerjee Subhash
Bartlett Andrew
Bartone Susan
Baum Howard
Bays Harold
Bazhan Larysa
Beasley Richard
Beaudry Yves
Beaulieu Gail
Beboso Ronnie
Bedard Jacques
Beijerbacht Hugo Peter
Belfort de Aguiar Renata
Benjamin Sabrina
Berger Dirk
Berlingieri Joseph
Berndtsson Blom Katarina
Besada Diego
Bethel M Angelyn
Bhagwat Ravi
Bhargava Anuj
Bhosale Archana
Bieler Tasso
Bijata-Bronisz Renata
Birkeland Kare
Birkenfeld Andreas
Blagden Mark
Bloomfield Gerald
Bode Bruce
Boemi Massimo
Bonadonna Riccardo
Bondar Irina
Bott Jochen
Bousboulas Stavros
Boye Alain
Bratcher Christina
Breneman Jody
Briskin Toby
Bristianou Magdalini
Brockmyre Andrew
Broughton Raymond
Brown Judith
Brown Kim
Brychta Tomas
Brønnum-Schou Jens
Budhraja Madhusudan
Bull Georgina
Bundy Charles
Burgess Lesley
Busch Klaus
Caca Karel
Cadinot Didier
Calderon Jorge
Calella Pedro
Califf Robert M
Calvo Gómez Carlos
Camacho Luis
Cannon Kevin
Cano Rodríguez Isidoro
Cantero Maria Cecilia
Carr Jewell
Casson Ed
Castaño Patricia
Castro Conde Almudena
Cathcart Harold
Cavale Arvind
Cech Vladimir
Cequier Fillat Angel
Cervantes-Escárcega Jose-Luis
Cha Bong-Soo
Chau Tuan
Chaykin Louis
Chehayeb Raja
Chen David
Chen Jung-Fu
Chertkoff Alejandro
Cheung Deanna
Cheung Stephen
Chevts Julia
Chiang Chern-En
Childress Richard
Christian Tamra
Chung Choon-Hee
Coetzee Kathleen
Cohen Allan
Coker Rebecca
Cole Joanna
Collier Jeannie
Condit Jonathan
Consoli Agostino
Conway James
Cooksey Erin
Cookson Tobias
Cooper John
Cooper Lauren
Copland Allan
Cornel Jan
Cornett George Mitchell
Crenier Laurent
Cuadrado Jesus
Cuatrecasas Cambra Guillem
Cusson Jean
D'Agostino Ralph B
D'Agostino Ralph B.
Daga Shruti
Damyanova Velichka
Dauber Ira
Davila William
Dawood Saleem
De Armas Luis
De Cosmo Salvatore
De Loredo Luis
De Teresa Parreño Luis
de Álvaro Moreno Fernando
Dean Julius
Debroye Corinne
Del Prato Stefano
Del Prato Stefano
Delgado Lista Javier
Della Siega Anthony
DeMets David
Demidova Irina
Derezinski Tadeusz
Deshpande Sameer
Detweiler Robert
Devore Adam
Di Bartolo Paolo
Di Giovanna Michael
Diaz Ernesto
Dimitriadis Georgios
Dimitrov Stefan
Distiller Larry
Dombrowski Keith
Dominguez Andrea
Domínguez Escribano José Ramón
Donaldson Jill
Donaubauer Torsten
Dor Isaac
Dotta Francesco
Dreval Alexander
Drummond Waymon
Dumas Richard
Dumbre Snehal
Dungan Kathleen
Durán García Santiago
Duyck Francis
Dvorakova Eva
Dzongowski Peter
Eagerton Donald
Eapen Zubin
Earl John
Eaton Charles
Ebo Geraldine
Edelsberger Tomas
Eliasson Bjorn
Eliasson Ken
Elisaf Moses
Ellison Howard
Elvira González Javier
Emslie-Smith Alistair
Endsley Patricia
Enns Robert
Erlinger Rudolf
Ershova Olga
Español Maria Vanesa
Espinoza Augusto Dextre
Farris Neil
Ferguson Murdo
Fernandez-Salazar Eva
Fernández Rodríguez José María
Fiel Thomas
Finkelstein Hernan
Firek Anthony
First Brian
Fleming John
Flota-Cervera Luis Fernando
Forgosh Les
Frechtel Gustavo
French William
Fretes Jose
Friedman Daniel
Frontoni Simona
Funke Klaus
Fushtey Ivan
Gadzinski Waldemar
Gajek Jacek
Galatius Søren
Galetta Marianna
Galstyan Gagik
Gambineri Alessandra
Gandy Winston
Garcia Ronald
Garganeeva Alla
Garrido Santos Natalia
Garrido Elizabeth
Garvey Louisa
Gaudet Daniel
Gazzaruso Carmine
Gentry Tracy
Giaccari Andrea
Gill Santosh
Giorgino Francesco
Goday Arno Alberto
Godse Rupali
Gomez Huelgas Ricardo
Gonzales Rolando Vargas
Gonzalez Joaquin
González Juanatey José Ramón
González-González José Gerardo
Goodman Shaun
Gordon Murray
Gouet Didier
Grachova Mariya
Granger Christopher B
Granger Christopher B.
Green Fiona
Green Jennifer B
Green Jennifer B
Greene Deb
Greene Stephen
Grondin Francois
Grosskopf Josef
Groutars Reginald G.E.J.
Guerci Bruno
Guice Michael
Guimaraes Patricia
Gulseth Hanne
Gummadi Siva
Gunstone Anthony
Gupta Anil
Gupta Milan
Gupta Nikita
Guttormsen Gaute
Ha JuYoung
Hackenyos Jonathan
Haddock Trevorlyn
Hagenow Andreas
Hairston Kristen
Hajdú Csaba
Halciakova Katarina
Halperin Frank
Haluzik Martin
Hamann Monika
Hanson Lenita
Haque Ghazala
Harcsa Eleonóra
Harris Tara
Harrison Lindsay
Harrison Rob
Hartard Manfred
Hartman Israel
Heitner John
Hejeebu Srini
Henry Patrick
Hereghty Belinda
Hermany Paul
Hernandez Mijares Antonio
Hernandez Adrian F
Hernandez Adrian F.
Hernandez-Cassis Carlos
Heymer Peter
Hidalgo Horacio
Higgins Alexander
Hoek Boudewijn A
Holman Rury
Hoogslag Pieter A.M.
Houle Pierre-Alain
Hove Jens
Howell Christina
Hramiak Irene
Huppertz Wolfgang
Hutayanon Pisit
Ibrahim Hassan
Illies Gabriele
Issa Basil
Izmozherova Nadezhda
Jackson-Voyzey Ewart
Jacob Stephan
Jacobs Shahram
James Kourtnei
James Rachel
Jang Hak Chul
Janmohamed Salim
Janmohamed Salim
Jarosz Marie
Jenkins Wendy
Jensen Jan Skov
Jerjes-Díaz Carlos Sánchez
Jiménez Díaz Víctor Alfonso
Jintapakorn Woravut
Jodar Gimeno Esteban
Johnson Andrew
Johnson David
Johnson Megan
Jones Michael
Jones Nigel P
Jones Nigel P
Jordan Dedrick
Joshi Parag
Jung Thomas
Kadgaonkar Neha
Kahrmann Gerd
Karetnikova Victoria
Karka Mounika
Karmalkar Anisha
Kast Petra
Kaster Steven
Katerenchuk Vitaliy
Kellerer Monika
Kellum Daniel
Kelly Aoife
Kempe Hans-Peter
Kessler Laurence
Khandelwal Chaitali
Kharakhulakh Marina
Khariouzov Andrei
Khokhlov Aleksandr
Kim Chong-Jin
Kim Christopher
Kim Ellen
Kim Hye Soon
Kim In Joo
Kirby William
Klausmann Gerhard
Klein Christiane
Kleinecke-Pohl Uwe
Kleinertz Klaus
Klodawska Katarzyna
Knouse Albert
Kobalava Zhanna
Kober Lars
Koch Thorsten
Kocsis Gyozo
Kolls Brad
Kong David
Konyves Laszlo
Kooy Adriaan
Korpachev Vadym
Kosch Christine
Koshelskaya Olga
Kosiborod Mikhail
Koskinen Pekka
Kosmacheva Elena
Kostin Vladimir
Kotsa Kalliopi
Kouz Simon
Kovacheva Snezhina
Kovacs Christopher
Koziolova Natalia
Kragten Johannes A.
Kravchun Nonna
Kristiansen Ole Peter
Krizova Jarmila
Krzyzagorska Ewa
Kulback Steven
Kulkarni Mangesh
Kumar Mariananda
Kumar Rakesh
Kumar Somesh
Kumari Aditi
Kuruvanka Tulsidas
Kuzin Anatoly
Labroo Ajay
Lalau Jean-Daniel
Landry Daniel
Larin Oleksandr
Larnefeldt Hans
Lasswell William
Lastuvka Jiri
Lauder Steven
Lauro Davide
Lee Eun Young
Lee Hyoung Woo
Lee Kwan-Woo
Lee Moon Kyu
Leiter Lawrence A
Leiter Lawrence A.
Lentz John
Lenzmeier Thomas
Lesnov Victor
Lewis David
Li Wenyan
Li Zhaoping
Lieverse Aloysius G.
Lif-Tiberg Cornelia
Lillestol Michael
Linderfalk Carina
Lindsay Alistair
Little Raymond
Litvak Marcos
Llamas Edmundo-Alfredo Bayram
Lokhngina Yuliya
Lombard Landman
Longley Troy
Lonn Eva
Lopes Renato
Lopez-Sendon Jose
Lorimer Jamie
Lorra Babette
Lorraine Richard
Loureyro Juan
Lozanov Lachezar
Lucas Morante Tomás
Luedemann Joerg
Luna Alejandro
Lund Gustav
Lund Per
Lundman Pia
Luttermann Matthias
Lysenko Tatyana
Maclean Malcolm
Madej Andrzej
Maffei Laura
Makotoko Ellen
Maldonado Natacha
Mandal Trisha
Mandawat Aditya
Mankovsky Boris
Manrique Helard
Marazuela Monica
Margaritov Viktor
Markov Valentin
Marquess Marsha
Martell Claros Nieves
Mathews Robin
Mathieu Chantal
Mauricio Puente Didac
Maxeiner Stephan
Mayorov Alexander
McGuire Darren
McKeown-Biagas Cecilia
McKnight John
McLaughlin Patricia
McMurray John J.V.
McMurray John J.V.
McNeill Robert
Mehta Anand
Mehta Rajendra
Melchior Thomas
Melidonis Andreas
Melloni Chiara
Mena Ribas Elena
Merino Torres Juan Francisco
Mezquita Raya Pedro
Mhambrey Ankita
Miekus Pawel
Migdalis Ilias
Mihaylova-Shumkova Rositsa
Milek Karsten
Miller Alan
Miller Diane
Mitrakou Asimina
Moelle Andrea
Mohr Gasparini Diego
Moiseev Sergey
Moodley Rajendran
Moon Keon-Woong
Morales-Palomares Ellen
Moran Joseph
Morawski Emily
Moris Linda
Mucsi János
Murphy Karen
Muzulu Solomon
Muñoz Ernesto German Cardona
Myasoedova Svetlana
Mykhalchyshyn Galyna
Myshanych Halyna
Nadar Madhumitha
Nadar Venkatesh
Naicker Pallavi
Namgung June
Nauck Michael
Neumann Gerhard
Nikolaeva Antoaneta
Nischik Ruth
Noronha Drusilla
Nubiola Calonge Andreu
Nyvad Ole
O'Connell Ian
O'Connor Thomas
O'Mahony William
Odio Alberto
Oehrig-Pohl Edith
Oleksyk Olga
Olesinska-Mader Martyna
Olsson Åke
Oosthuysen Wessels
Opiela Jaroslaw
Ordoñez Sánchez Xavier
Orio Silvia
Orlenko Valeriia
Orsi Emanuela
Overton Robert
Oyesile Babatunde
Ozaki Risa
Pagkalos Emmanouil
Paolisso Giuseppe
Papanas Nikolaos
Pappas Angelos
Park Kyong Soo
Parker Reginald
Pascoe-Gonzalez Sara
Pascual Izuel Jose Maria
Pashkovska Nataliia
Patel Chetan
Patel Prashant
Patel Rajesh
Patel Rajivkumar
Patel Vickas
Patil Ashwini P
Patil Suradnya
Patterson Catherine
Pauker Zsolt
Pearson Ewan
Pelayo-Orozco Emilia Susana
Perea Castilla Verónica
Perez Manghi Federico
Perez Soto Isabel
Perez Kathleen
Perriello Gabriele
Pertseva Nataliia
Pesarchick Jean
Peterson Sean
Petit Catherine
Petrosyan Olena
Petró Gizella
Petunina Nina
Philip Sam
Phillips Lawrence
Phrommintikul Arintaya
Piatti Piermarco
Pinto Miguel
Piperek Martin
Plassmann Georg
Plés Zsolt
Pohlmeier Lars
Pontiroli Antonio
Ponzani Paola
Pope Connie
Potnis Omkar
Povsic Thomas
Prakash Rahul
Price Lauren
Price-Olsen Emma
Proepper Felix
Prymkova Vera
Pérez Pérez Antonio
Quesada Charneco Miguel
Quesada Simón Angustias
Raad George
Raclavska Lea
Rahman Aref
Raikhel Marina
Raisinghani Ajit
Rajan Raj
Ramirez-Diaz Santiago-Paulino
Raorane Amruta
Rasouli Neda
Rauzi Frank
Rebrov Andrey
Redón Mas Josep
Regner Stefan
Rego Iraeta Antonia
Reich Dennis
Rethaford Renee
Revesz Katalin
Revoredo Frederick Massucco
Rich Jenny
Richards Jackie
Riddle Matthew
Rieker Werner
Rincon Luis Zapata
Rivellese Angela Albarosa
Rodriguez Alvarez Maria
Rodriguez Papini Nelson
Rodríguez Rodríguez Irene
Roe Matthew
Rohr Kathryn
Romanczuk Piotr
Rose Ludger
Roseman Hal
Rosenberg Anne
Rosenberg Anne E
Rosenbloom Alan
Ross Jorge
Roush Jennifer
Rovner Sergio
Russell Rachael
Ruyatkina Ludmila
Saba Fadi
Sabol Mary Beth
Sabán Ruiz José
Sachson Richard
Sahu Shruti
Sala Jorgelina
Saldana-Mendoza Arturo
Salmonsson Staffan
Samer Holger
Samoylova Julia
Sampanis Christos
Sandhu Chamandeep
Sanmartin Berglund Johan
Sansanayudh Nakarin
Sarvan Mohamed
Saswadkar Aarti
Sauter Joachim
Sazonova Olga
Schabauer Alex
Schaper Frank
Scheen André
Schiffer Clemens
Schleiden Debra
Schmidt Juergen
Schneider Ricky
Scholz Bernd-M.
Schou Morten
Schubert Jessica
Schuchard Timothy
Schultheis Ron
Schulze Joerg
Schuyler Jones William
Schygiel Pablo
Sedani Sangeeta
Segner Alexander
Sensenbrenner John
Sesti Giorgio
Seufert Jochen
Shaik Fayaz
Shannon Jennifer
Sharma Abhinav
Sharma Richa
Shavadia Jay
Shestakova Marina
Shilkina Natalia
Shinde Rakesh
Shitut Aditi
Shlesinger Yshay
Siegel Anna
Sigal Helena
Sigmon Kristina
Sigmon Kristina N
Silhova Elena
Sinay Isaac
Singh Narendra
Sivalingam Kanagaratnam
Sjöberg Folke
Skokowska Ewa
Smirnov Ivan
Smith Paul
Smith Peter
Soffer Joseph
Sokolova Nadezhda
Solanki Sheetal
Somerville Matthew
Somerville Matthew C
Soto González Alfonso
Sowell Margaret
Sposetti Georgina
Srimahachota Suphot
St-Maurice Francois
Stankiewicz Andrzej
Stasinska Teresa
Stead Janet
Steindorf Joerg
Stewart Murray
Stockhausen Juergen
Stone Allegra
Stonesifer Larry
Storey Daniel
Stoyanova Zhasmina
Stuebler Petra
Suh David
Sukumaran Usha
Swart Hendrik P.
Sándor Vangel
Söderberg Stefan
Taeschner Heidrun
Taft Lin
Tahir Mohammed
Tan Anjanette
Tan Choon Beng Kathryn
Tan Marilyn
Taylon Alain
Tentolouris Nikolaos
Tews Dietrich
Thakkar Maitreya
Thomas Betsy
Thorpe Karl
Thorpe Karl M
Thuan Jean-Francois
Tien Kai Jen
Tinahones Madueno Francisco
Tirador Louie
Tonolo Giancarlo
Torres-Colores Jose Juan
Torstensson Ingemar
Trescoli Serrano Carlos
Trevisan Roberto
Tricoci Pierluigi
Tripathy Devjit
Trznadel-Morawska Iwona
Tsapas Apostolos
Tschoepe Diethelm
Tseng Shih-Ting
Tu Shih-Te
Tugwell Barna
Tzatzagou Glykeria
Ulied Armiñana Angels
Ulla Maria
Urbanek Robin
Uwaifo Gabriel
Van Gaal Luc
Vandyne Beth
Vaphare Ashok
Vasilevskaya Olga
Vass Viktor
Vedere Amarnath
Vemulapalli Sreekanth
Venugopal Chandra
Verbovaya Nelli
Vercammen Chris
Verra Fernando
Vidrio-Velázquez Maricela
Viergever Eric P.
Villagordoa-Mesa Juan
Vishneva Elena
Vizel Saul
Vlasenko Maryna
Vo Anthony
Voinot Christel
Vorobyev Sergey
Vorokhobina Natalya
Vouillarmet Julien
Vrkoc Jan
Wang Chih-Yuan
Wang Ji-Hung
Watson Anthony
Watson David
Welch Michelle
Welker James
White Alexander
Wilhelm Karl
Willis John
Wilson Matt
Wilson Tim
Witek Robert
Wium Cecilie
Womer Terese
Woo Vincent
Wraight Ella
Wynne Alan
Xing Weibing
Yamwong Sukit
Yazdani Shahram
Ye June
Yoo Soon Jib
Yordanov Victor
Yu Jaemyung
Zabalua Silvina
Zaidman Cesar
Zang Eddie
Zanozina Olga
Zeller-Stefan Helga
Zhdanova Elena
Zlova Tetiana
Zubiate Carlos
Zykova Tatyana
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Background: Glucagon-like peptide 1 receptor agonists differ in chemical structure, duration of action, and in their effects on clinical outcomes. The cardiovascular effects of once-weekly albiglutide in type 2 diabetes are unknown. We aimed to determine the safety and efficacy of albiglutide in preventing cardiovascular death, myocardial infarction, or stroke. Methods: We did a double-blind, randomised, placebo-controlled trial in 610 sites across 28 countries. We randomly assigned patients aged 40 years and older with type 2 diabetes and cardiovascular disease (at a 1:1 ratio) to groups that either received a subcutaneous injection of albiglutide (30–50 mg, based on glycaemic response and tolerability) or of a matched volume of placebo once a week, in addition to their standard care. Investigators used an interactive voice or web response system to obtain treatment assignment, and patients and all study investigators were masked to their treatment allocation. We hypothesised that albiglutide would be non-inferior to placebo for the primary outcome of the first occurrence of cardiovascular death, myocardial infarction, or stroke, which was assessed in the intention-to-treat population. If non-inferiority was confirmed by an upper limit of the 95% CI for a hazard ratio of less than 1·30, closed testing for superiority was prespecified. This study is registered with ClinicalTrials.gov, number NCT02465515. Findings: Patients were screened between July 1, 2015, and Nov 24, 2016. 10 793 patients were screened and 9463 participants were enrolled and randomly assigned to groups: 4731 patients were assigned to receive albiglutide and 4732 patients to receive placebo. On Nov 8, 2017, it was determined that 611 primary endpoints and a median follow-up of at least 1·5 years had accrued, and participants returned for a final visit and discontinuation from study treatment; the last patient visit was on March 12, 2018. These 9463 patients, the intention-to-treat population, were evaluated for a median duration of 1·6 years and were assessed for the primary outcome. The primary composite outcome occurred in 338 (7%) of 4731 patients at an incidence rate of 4·6 events per 100 person-years in the albiglutide group and in 428 (9%) of 4732 patients at an incidence rate of 5·9 events per 100 person-years in the placebo group (hazard ratio 0·78, 95% CI 0·68–0·90), which indicated that albiglutide was superior to placebo (p<0·0001 for non-inferiority; p=0·0006 for superiority). The incidence of acute pancreatitis (ten patients in the albiglutide group and seven patients in the placebo group), pancreatic cancer (six patients in the albiglutide group and five patients in the placebo group), medullary thyroid carcinoma (zero patients in both groups), and other serious adverse events did not differ between the two groups. There were three (<1%) deaths in the placebo group that were assessed by investigators, who were masked to study drug assignment, to be treatment-related and two (<1%) deaths in the albiglutide group. Interpretation: In patients with type 2 diabetes and cardiovascular disease, albiglutide was superior to placebo with respect to major adverse cardiovascular events. Evidence-based glucagon-like peptide 1 receptor agonists should therefore be considered as part of a comprehensive strategy to reduce the risk of cardiovascular events in patients with type 2 diabetes. Funding: GlaxoSmithKline

PubliCatt

Archivio istituzionale della ricerca - Università di Bari

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Enlighten

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"

Transforming scholarship in the archives through handwritten text recognition:Transkribus as a case study

Author: Ares Oliveira Sofia
Bryan Maximilian
Colutto Sebastian
Diem Markus
Déjean Hervé
Fiel Stefan
Gatos Basilis
Greinoecker Albert
Grüning Tobias
Hackl Guenter
Haukkovaara Vili
Heyer Gerhard
Hirvonen Lauri
Hodel Tobias
Jokinen Matti
Jokinen Philip
Kallio Mario
Kaplan Frederic
Kleber Florian
Labahn Roger
Lang Eva Maria
Laube Sören
Leifert Gundram
Louloudis Georgios
McNicholl Rory
Meunier Jean-Luc
Michael Johannes
Muehlberger Guenter
Mühlbauer Elena
Philipp Nathanael
Pratikakis Ioannis
Puigcerver Pérez Joan
Putz Hannelore
Retsinas George
Romero Verónica
Sablatnig Robert
Schofield Philip
Seaward Louise
Sfikas Georgios
Sieber Christian
Stamatopoulos Nikolaos
Strauss Tobias
Sánchez Joan Andreu
Terbul Tamara
Terras Melissa
Toselli Alejandro Hector
Ulreich Berthold
Vicente Bosch
Vidal Enrique
Villega Mauricio
Walcher Johanna
Weidemann Max
Wurster Herbert
Zagoris Konstantinos
Publication venue: 'Emerald'
Publication date: 09/09/2019
Field of study

Purpose: An overview of the current use of handwritten text recognition (HTR) on archival manuscript material, as provided by the EU H2020 funded Transkribus platform. It explains HTR, demonstrates Transkribus, gives examples of use cases, highlights the affect HTR may have on scholarship, and evidences this turning point of the advanced use of digitised heritage content. The paper aims to discuss these issues. - Design/methodology/approach: This paper adopts a case study approach, using the development and delivery of the one openly available HTR platform for manuscript material. - Findings: Transkribus has demonstrated that HTR is now a useable technology that can be employed in conjunction with mass digitisation to generate accurate transcripts of archival material. Use cases are demonstrated, and a cooperative model is suggested as a way to ensure sustainability and scaling of the platform. However, funding and resourcing issues are identified. - Research limitations/implications: The paper presents results from projects: further user studies could be undertaken involving interviews, surveys, etc. - Practical implications: Only HTR provided via Transkribus is covered: however, this is the only publicly available platform for HTR on individual collections of historical documents at time of writing and it represents the current state-of-the-art in this field. - Social implications: The increased access to information contained within historical texts has the potential to be transformational for both institutions and individuals. - Originality/value: This is the first published overview of how HTR is used by a wide archival studies community, reporting and showcasing current application of handwriting technology in the cultural heritage sector

Infoscience - École polytechnique fédérale de Lausanne

UCL Discovery

Edinburgh Research Explorer

ZORA

Bern Open Repository and Information System (BORIS)

Novel methods for writer identification and retrieval

Author: Fiel Stefan
Publication venue: Wien
Publication date: 01/01/2015
Field of study

Zusammenfassung in deutscher SpracheWriter identification is the task of identifying the writer of a handwritten document, based on a set of documents where the authors are known. It can be used e.g. for tasks in forensics and for historical document analysis. In contrast to this, writer retrieval is to receive a ranking of the pages in the set of documents sorted according to the similarity of handwriting and can be used for clustering a not indexed set of documents according to the individual handwriting. State-of-the-art methods calculate features on the contours of the characters, so pre-processing steps are needed to extract this contour. In contrast to this in this thesis, three novel approaches for writer identification and writer retrieval are presented. The first is based on the bag of words approach, which is well known for object recognition. SIFT features are calculated on the handwriting and then an occurrence histogram is generated which is then used for the identification of the writer. The second method is based on the Fisher vector. Again, SIFT features are generated on the handwriting, but this time the gradient vectors of a Gaussian Mixture Model (GMM) are used to generate the feature vector for writer identification. The last method is based on Convolutional Neural Network (CNN). A CNN is trained on image patches and the classification layer is cut off and the second last layer is used as feature vector for this patch. The mean vector of all patches on one page is the feature vector for the handwriting and is used for identification and retrieval. The methods presented are evaluated and compared to the state of the art on different scientific databases and additionally on a historic dataset using common evaluation metrics for writer identification. The evaluations show that the three methods proposed outperform the state of the art on many of the different tasks on these datasets. Advantages and possible weaknesses are discussed. The methods proposed achieve good results (>90%) on every dataset used for evaluation.11

reposiTUm

Learning Features for Writer Retrieval and Identification using Triplet CNNs

Author: Fiel Stefan
Keglevic Manuel
Sablatnig Robert
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

The final publication is available via https://doi.org/10.1109/ICFHR-2018.2018.00045.This paper presents a method for writer retrieval and identification using a feature descriptor learned by a Convolutional Neural Network. Instead of using a network for classification, we propose the use of a triplet network that learns a similarity measure for image patches. Patches of the handwriting are extracted and mapped into an embedding where this similarity measure is defined by the L2 distance. The triplet network is trained by maximizing the interclass distance, while minimizing the intraclass distance in this embedding. The image patches are encoded using the learned feature descriptor. By applying the Vector of Locally Aggregated Descriptors encoding to these features, we generate a feature vector for each document image. A detailed parameter evaluation is given which shows that this method achieves a mean average precision of 86.1% on the ICDAR 2013 writer identification dataset, but future work has to be done to improve the performance on historic datasets. In addition, the strategy for clustering the feature space is investigated.European Union's Horizon 202

reposiTUm

Word Beam Search: A Connectionist Temporal Classification Decoding Algorithm

Author: Fiel Stefan
Sablatnig Robert
Scheidl Harald
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

The final publication is available via https://doi.org/10.1109/ICFHR-2018.2018.00052.Recurrent Neural Networks (RNNs) are used for sequence recognition tasks such as Handwritten Text Recognition (HTR) or speech recognition. If trained with the Connectionist Temporal Classification (CTC) loss function, the output of such a RNN is a matrix containing character probabilities for each time-step. A CTC decoding algorithm maps these character probabilities to the final text. Token passing is such an algorithm and is able to constrain the recognized text to a sequence of dictionary words. However, the running time of token passing depends quadratically on the dictionary size and it is not able to decode arbitrary character strings like numbers. This paper proposes word beam search decoding, which is able to tackle these problems. It constrains words to those contained in a dictionary, allows arbitrary non-word character strings between words, optionally integrates a word-level language model and has a better running time than token passing. The proposed algorithm outperforms best path decoding, vanilla beam search decoding and token passing on the IAM and Bentham HTR datasets. An open-source implementation is provided.European Union's Horizon 202

reposiTUm

Mass Digitization of Archival Documents using Mobile Phones

Author: Diem Markus
Fiel Stefan
Hollaus Fabian
Kleber Florian
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

The final publication is available via https://doi.org/10.1145/3151509.3151526.Digital copies of historical documents are needed for the Digital Humanities. Currently, cameras of standard mobile phones are able to capture documents with a resolution of about 330 dpi for document sizes up to DIN A4 (German standard, 297 x 210 mm), which allows a digitization of documents using a standard device. Thus, scholars are able to take images of documents in archives themselves without the need of book scanners or other devices. This paper presents a scanning app, which comprises a real time page detection, quality assessment (focus measure) and an automated detection of a page turn over if books are scanned. Additionally, a portable device - the ScanTent - to place the mobile phone during scanning is presented. The page detection is evaluated on the ICDAR2015 SmartDoc competition dataset and shows a reliable page detection with an average Jaccard index of 75%.European Union's Horizon 202

reposiTUm

cBAD: ICDAR2017 Competition on Baseline Detection

Author: Diem Markus
Fiel Stefan
Gatos Basilis
Grüning Tobias
Kleber Florian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

The final publication is available via https://doi.org/10.1109/ICDAR.2017.222.The cBAD competition aims at benchmarking state-of-the-art baseline detection algorithms. It is in line with previous competitions such as the ICDAR 2013 Handwriting Segmentation Contest. A new, challenging, dataset was created to test the behavior of state-of-the-art systems on real world data. Since traditional evaluation schemes are not applicable to the size and modality of this dataset, we present a new one that introduces baselines to measure performance. We received submissions from five different teams for both tracks.European Union's Horizon 202

reposiTUm

ICDAR2017 Competition on Historical Document Writer Identification (Historical-WI)

Author: Christlein Vincent
Diem Markus
Fiel Stefan
Gatos Basilis
Kleber Florian
Louloudis Georgios
Stamatopoulos Nikolaos
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

The final publication is available via https://doi.org/10.1109/ICDAR.2017.225.The ICDAR 2017 Competition on Historical Document Writer Identification is dedicated to record the most recent advances made in the field of writer identification. The goal of the writer identification task is the retrieval of pages, which have been written by the same author. The test dataset used in this competition consists of 3600 handwritten pages originating from 13th to 20th century. It contains manuscripts from 720 different writers where each writer contributed five pages. This paper describes the dataset, as well as the details of the competition. Five different institutions submitted six methods which were ranked using identification and retrieval metrics. The paper describes the competition details including the dataset, the evaluation measures used as well as a short description of each submitted method.European Union's Horizon 202

reposiTUm